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Abstract -Understanding the statistical properties of recurrence intervals of extreme events is 
crucial to risk assessment and management of complex systems. The probability distributions 
and correlations of recurrence intervals for many systems have been extensively investigated. 
However, the impacts of microscopic rules of a complex system on the macroscopic properties of 
its recurrence intervals are less studied. In this Letter, we adopt an order-driven stock market 
model to address this issue for stock returns. We find that the distributions of the scaled recurrence 
intervals of simulated returns have a power law scaling with stretched exponential cutoff and the 
intervals possess multifractal nature, which are consistent with empirical results. We further 
investigate the effects of long memory in the directions (or signs) and relative prices of the order 
flow on the characteristic quantities of these properties. It is found that the long memory in 
the order directions (Hurst index H 3 ) has a negligible effect on the interval distributions and the 
multifractal nature. In contrast, the power-law exponent of the interval distribution increases 
linearly with respect to the Hurst index H x of the relative prices, and the singularity width of the 
multifractal nature fluctuates around a constant value when H x < 0.7 and then increases with 
H x . No evident effects of H s and H x are found on the long memory of the recurrence intervals. 
Our results indicate that the nontrivial properties of the recurrence intervals of returns are mainly 
caused by traders' behaviors of persistently placing new orders around the best bid and ask prices. 



Introduction. — Most natural and socioeconomic 
systems are complex systems in which extreme events 
occur more frequently than Gaussian and exhibit com- 
plex behaviors p~H2]. Such extreme events usually cause 
tremendous damages on the systems, incurring losses of 
lives, belongings and properties. Hence, understanding 
the behaviors of extreme events is crucial to the risk as- 
sessment and management of complex systems. However, 
it is not an easy task due to the lack of statistics of ex- 
treme events. One strategy to overcome this difficulty is 
to investigate the evolution of the statistical properties of 
recurrence intervals of less extreme events and extreme 
events and make inference by extrapolations [JHZ]- 
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The recurrence interval analysis has been performed on 
different time series recorded in diverse systems, including 
simulated time series with long-term correlations [4"ll8"ttT5"] , 
climate records [SE] , earthquakes [T6h"T8] , heartbeats [19] , 
energy dissipation rate in three-dimensional fully devel- 
oped turbulence [5D], and volatilities [TI I!n"H3"4] . returns 
[T^1[55MD] . and trading volumes [4TTH3"] in financial mar- 
kets. For financial returns, the recurrence intervals are dis- 
tributed in power-law forms and possess long-term mem- 
ories [T21l55rl59"] . These empirical results are crucial to the 
risk assessment of large price fluctuations jT2 , 35 , 38 . 3 91I44J . 

The effects of linear and nonlinear long-term correla- 
tions of the original time series on the statistical properties 
of the recurrence intervals have been investigated, which 
is at the macroscopic level. It is found that the linear 
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long-term correlations in the original time series cause the 
long memory in the recurrence intervals [7] . If the original 
records contain multifractal nature, the distribution of the 
recurrence intervals decays as a power law |12j ; Otherwise, 
the distribution has a Wcibull form. However, an inves- 
tigation of the microscopic factors that lead to those sta- 
tistical properties of recurrence intervals still lacks. This 
Letter contributes to this topic to understand the effects 
of long memory in the order submission process on the 
properties of recurrence intervals of stock returns using a 
microscopic stock trading model. 

The price return is defined as the logarithmic difference 
of stock prices. We normalize the return records by divid- 
ing its standard deviation to obtain the normalized return 
time series R(t). The recurrence interval rg is the wait- 
ing time between two consecutive normalized price returns 
above a positive threshold Q > 0. Since the distributions 
of recurrence intervals for positive and negative thresh- 
olds arc symmetric, we present the results for the return 
intervals between large positive price returns. 

Modified Mike-Farmer model. There are var- 
ious microscopic models for stock price formation pro- 
cesses, most of which are agent-based models constructed 
according to intuitive and/or empirical regularities and 
designed for quote-driven market [45] . There are also 
order-driven market models [46H49| , and to our knowl- 
edge, the empirical behavioral model proposed by Mike 
and Farmer is the first and most realistic model |50j . The 
Mike-Farmer model mimics the processes of order place- 
ment and order cancelation, and orders are executed ac- 
cording to the price-time priority rule. The processes of 
order placement and cancelation are simulated based on 
the statistical properties extracted from real order flows 
of stocks. To submit an order, one needs to determine its 
direction (buy or sell), price and size. To cancel an or- 
der on the limit order book, one needs to identify possible 
factors causing a trader to withdraw his/her submitted or- 
ders. The framework of the Mike- Farmer model opens a 
promising direction for computational experimental fiance 
since it allows us to understand the macroscopic properties 
of stocks from the realistic microscopic level. 

The Mike-Farmer model integrates two statistical reg- 
ularities |50j . i.e., the long memory of order directions 
characterized by its Hurst index H s and the Student dis- 
tribution of the relative logarithmic order price x, which 
is defined as the logarithmic difference of the order price 
to the same best price. Let w(t) be the logarithmic price 
of a submitted order at time t, and 7Tb (t) and TT a (t) be 
the logarithms of the current best bid price and best ask 
price. The relative price x(t) at time t can be defined as 
x(t) = ir(t) — 7Tb (i) for buy orders and x(t) = ir a (t) — ir(t) 
for sell orders. This model is able to reproduce excellent 
return distributions at different time scales comparable to 
real data and the absence of long memory in the return 
time series [50ll5"T] . 

However, the Mike-Farmer model fails to reproduce the 
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Fig. 1: Realization segments of standardized returns R(t) simu- 
lated from the modified Mike- Farmer model for different values 
of H s and H x : (a) H x = 0.5 and H s = 0.5, (b)H x = 0.75 and 
H s = 0.5, (c) H x = 0.5 and H s = 0.75, and (d) H x = 0.75 and 
H s = 0.75. 



long- memory property of volatility |50j . After further in- 
cluding in the model another stylized fact that the relative 
prices of submitted orders are long-term correlated, Gu 
and Zhou proposed a modified Mike-Farmer model which 
is able to capture nicely the long memory in the volatility 
without distorting the power-law tails in the return dis- 
tributions and the absence of memory in the return series 
[52] . The return time series generated from the modified 
Mike- Farmer model also exhibit the well-known multifrac- 
tal nature [52] . It is also found that a Poisson cancelation 
process is sufficient to obtain these stylized facts [5"l~ll5"2] . 
In this Letter, we adopt this modified Mike-Farmer model. 
Specifically, the order directions are simulated from the 
signs of the increments of a fractional Brownian motion of 
Hurst index H s . The relative order prices are drawn from 
a Student distribution and then manipulated to introduce 
long-term correlations of Hurst index H x using the ampli- 
tude adjusted Fourier transform algorithm [53] . The order 
size is fixed to be unity [50H52] . 

In our simulations, the distribution of the relative prices 
is fixed to be a symmetric Student distribution with the 
freedom degree being 1.3 and the scale parameter 0.0024 
[50] . Actually, our numerical experiments indicate that 
these parameters have negligible impacts on the statisti- 
cal properties of the recurrence intervals. We vary the 
values of H s and H x from 0.5 to 0.9. For each round of 
simulation, there are 2 x 10 6 steps, and about 500,000 re- 
turns are recorded for analysis. For each pair of H s and 
H x , five rounds of simulations are performed and the re- 
sults presented below are averaged with these repeated 
simulations. Figure [T] illustrates four segments of return 
realizations for different values of H s and H Xl where the 
original returns are standardized. 
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Fig. 2: (Color online) Scaled probability distributions of the 
return recurrence intervals above positive thresholds Q — 1,2,3 
and 4 for four pairs of H„ and H x : (a) H x = 0.5 and H„ = 0.5, 
(b)H x = 0.75 and H s = 0.5, (c) H x = 0.5 and H a = 0.75, and 
(d) H x = 0.75 and H s = 0.75. The solid lines are the fits to 
the generalized Gamma function. 



Scaled PDF of recurrence intervals. We first 
investigate the probability distribution functions (PDFs) 
Pq(t) of return intervals. For each group of model param- 
eters, we perform five rounds of simulations and calculate 
five time series of recurrence intervals for different thresh- 
olds Q. For each Q, the interval time series are divided 
by their corresponding mean values (r) and put them to- 
gether. We find that there is no asymmetry between the 
two recurrence interval PDFs for Q and — Q, which is con- 
sistent with empirical data [38]. We thus present the re- 
sults for positive thresholds Q > 0. Figure [2] illustrates 
the scaled PDFs {t)Pq{t) as a function of scaled return 
intervals r/ (r) above four thresholds Q = 2, 3, 4 and 5 for 
four pairs of H x and H s . 

As we can see in each plot of Fig. [2] the scaled PDFs 
for different thresholds Q differ from each other for small 
r/ (r). When r j (r) > 10 -2 , the PDFs in each plot collapse 
onto a single master curve for different thresholds. In other 
words, we have 



(r)P Q (r)=p(r/(r)), 



(1) 



where p{x) is a master curve independent of Q for large 
r/(r). We find that p(x) can be well fitted using a gener- 
alized Gamma function 



P 



(x) =Ax-Pe-~< x \ 



(2) 



where j3, 7 and S are parameters and A is the normaliza- 
tion factor 

We fitted the generalized Gamma distribution for different 
values of H x and H s . There is no clear evidence that 7 



Table 1: Fitted power-law exponents /3 in Eq. ([2J for different 
values of H x and H a . 
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0.60 
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0.70 


0.694 


0.717 


0.730 


0.643 


0.677 


0.80 


0.774 


0.803 


0.773 


0.759 


0.744 


0.90 


0.835 


0.868 


0.858 


0.823 


0.785 



and S depends on H x or H s . In contrast, Table Q] shows 
that the power-law exponent (3 increases with H x for each 
H s . For fixed H x , the relation between j3 and H s is quite 
complicated. When the relative prices are uncorrelated 
(H x ps 0.5), the exponent (3 increases with H s . When 
the relative prices are positively correlated (H x > 0.5), /3 
shows an inverse V-shape with respect to H s . 

The dependence of (3 on H x can be explained qualita- 
tively. Numerical simulations show that the Hurst index 
of the absolute returns increases with H x [52]. Hence, the 
clustering phenomenon becomes more significant for large 
H x , as illustrated in Fig. [T] Fixing the threshold Q, there 
are more small recurrence intervals and less large recur- 
rence intervals for large H x . Therefore, the distribution 
of the recurrence intervals decays faster for larger H x , re- 
sulting in a larger power-law exponent /3. On the contrary, 
there is no clear dependence between the Hurst index of 
absolute returns and the memory effect of order directions 
[52j . This might explain the ambiguous relationship be- 
tween j3 and H s . 

If we plot (3 against H x for fixed H s , we observe nice 
linear relationship, as shown in Fig. [3] We fit the data 
points in Fig. [3] to the following linear equation 
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Fig. 3: (Color online) Dependence of the power-law exponent 
P on H x and H s . The planar grid is the best fit of the data 
points to j3 = a + bH x + cH s , where a = 0.243, b — 0.665 and 
c = -0.0174. 
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Fig. 4: (Color online) Detrended fluctuation functions F(s) the 
return recurrence intervals above positive thresholds Q — 1,2,3 
and 4 for four pairs of H s and H x : (a) H x = 0.5 and H s = 0.5, 
(b)H x = 0.75 and H a = 0.5, (c) H x = 0.5 and H s = 0.75, and 
(d) H x = 0.75 and H s = 0.75. The curves have been vertically 
shifted for better visibility. 

We find that this equation can be accepted at the signif- 
icance level of 5%. The fitted coefficients are a = 0.243, 
b = 0.665 and c = —0.0174. A t-test for the coefficients 
a, b and c gives that the p- values are p a = Pb = 0.00 and 
p c = 0.60. It means that the coefficient b is significantly 
different from zero while c is insignificant. These results 
are consistent with the observations from table [1] 

Long-term correlation in recurrence intervals. 

There is evidence showing that the recurrence intervals of 
positive and negative stock returns exhibit long-term cor- 
relations [38.. Although the return time series is uncor- 
rected, the time series of all positive or negative returns 
are correlated. This long-term correlation is the source of 
the long-term correlation in the recurrence intervals of re- 
turns [35] • This is similar as for financial volatility, where 
the long-term correlation in the intervals stems from the 
long-term correlation in the volatility series [7] . 

To investigate the long-term correlation in the recur- 
rence intervals, we adopt the detrended fluctuation anal- 
ysis (DFA) method [54H58] . The DFA method calculates 
the detrended fluctuation function F(s) of the time series 
within a window of s points after removing a linear trend. 
For long-term power-law correlated time series, F(s) is ex- 
pected to scale as a power law with respect to the time 
scale s 

F( S )~s H . (5) 

For each group of the input parameters, there are five 
series of return intervals for each Q. We calculate their 
Hq,i (i = 1, 2, 3, 4, 5) separately and average them 

1 5 

H Q = T^2 H Q,i- (6) 



Fig. 5: (Color online) Variation of the estimated DFA expo- 
nents Hq of recurrence interval series above different thresholds 
Q = 1 (a), Q = 2 (b), Q = 3 (c) and Q = 4 (d) with respect to 
H x for different H 3 . 

FigureSlplots in double logarithmic scales the detrended 
fluctuation functions F(s) of the return recurrence inter- 
vals above positive thresholds Q = 2, 3, 4 and 5 with 
respect to the scale s for four typical pairs of H s and H x . 
Power-law relations are observed and the associated DFA 
exponents Hq can be determined by linear regressions. 

Figure [5] shows the dependence of the estimated DFA 
exponents Hq of recurrence interval series above different 
thresholds with respect to H x for different H s . Wc find 
that the recurrence intervals exhibit weak memory effects 
since the Hq values are only slightly larger than 0.5. A 
careful scrutiny unveils that, for fixed H x , Hq reaches a 
maximum when H s is close to 0.7. The DFA exponents 
Hq for the recurrence intervals of the shuffled return scries 
fluctuate around 0.5 and the large Hq values are found to 
be significantly larger than 0.5. The observation of weak 
memory effects in the simulated recurrence intervals is in- 
consistent with the empirical finding [38] , which indicates 
that certain important ingredients are missing in the mod- 
ified Mike-Farmer model. 

Multifractality in recurrence intervals. To our 

knowledge, the multifractal nature of recurrence intervals 
has only been investigated for energy dissipation rates 
|20j . Here, we adopt the multifractal detrended fluctua- 
tion analysis approach [59| to investigate the multifractal 
nature of the simulated interval series. We expect that the 
qth order fluctuation function F q (s) scales with respect to 
the box size s as a power law 

F,(*)~* h <«>. (7) 

We can calculate the scaling exponents r(q) as follows 

T(q) = qh(q) - 1. (8) 

If h(q) is a nonlinear function of q (thus r{q) is also a 
nonlinear function of q), the recurrence intervals exhibit 
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Fig. 6: (Color online) Multifractal analysis of recurrence inter- 
vals for Q = 4. (a) MFDFA exponents h(q) for typical pairs 
of H x and H B . (b) Scaling exponents r{q) calculated from the 
MFDFA exponents in plot (a), (c) Singularity spectrum /(a) 
calculated from the MFDFA exponents in plot (a), (d) Depen- 
dence of singularity width Aa on H x and H s . 

multifractal nature. We can also calculate the singularity 
spectra by 

f(a) = q[a-h(q)] + l, (9) 

where a = h(q) + qh'(q) is the singularity strength. A 
broad spectrum of singularity indicates the presence of 
multifractality. Plots (a), (b) and (c) of Fig. [5] illustrate 
respectively the h(q), r(q) and /(a) functions for several 
pairs of H x and H s for the threshold Q = 4, in which 
each curve is averaged over five round of simulations. All 
these plots show that the recurrence interval series exhibit 
solid multifractal nature. According to Fig. [6jc), there 
are also negative /(a) values, known as negative or la- 
tent dimensions, which stem from random multifractals 
or lower-dimensional cuts of multifractal objects [rJCJM62j . 

In plots (a) to (c) of Fig. [6j we also notice that the 
curves differ from one another, especially for large val- 
ues of H x , which indicates that the multifractal nature of 
the associated recurrence intervals have different strength. 
We adopt the width of the singularity spectrum Aa to 
quantify the multifractal strength [55] . The larger the 
singularity width, the stronger the multifractality. Fig- 
ure [6jd) illustrates the dependence of Aa with respect to 
H x for different H s . No clear dependence is found be- 
tween Aa and H s . On the contrary, Aa remains stable 
when 0.5 < H < 0.7 and increases for larger H x . The 
results are qualitatively the same for other values of Q. 

Conclusions. — In summary, we have investigated the 
effects of long memory in the order submission process on 
the statistical properties of recurrence intervals of large 
price fluctuations through numerical experiments based 
on an order-driven stock trading model [52 . 



We found that the distributions of the scaled recur- 
rence intervals of simulated returns can be well fitted with 
the generalized Gamma distribution, a power law with a 
stretched exponential decay. The Hurst index H x of the 
relative prices of submitted orders is found to have a sig- 
nificant impact on the power-law exponent f3, while the 
Hurst index H s of order directions does not. Specifically, 
the power-law exponent j3 increases linearly with the Hurst 
index H x . 

We also found that the recurrence intervals have weak 
long-term correlations based on the detrended fluctuation 
analysis, which is only partially consistent with empirical 
findings [35] . No clear effects of H x and H s were observed 
on the Hurst index (or more precisely the DFA exponent) 
Hq of the recurrence intervals. 

We further confirmed that the intervals possess multi- 
fractal nature. It is found that the long memory in the 
order directions has a negligible effect on the multifractal 
nature of recurrence intervals. In contrast, the singularity 
width of the multifractal nature fluctuates around a con- 
stant value when H x < 0.7 and then increases for larger 
values of H x . 

These findings indicate that the nontrivial properties 
of the recurrence intervals of returns are mainly caused 
by traders' behaviors of persistently placing new orders 
around the best bid and ask prices. It also implies that the 
original Mike-Farmer model [3U] is not capable of investi- 
gating the topic in this Letter since it does not contain the 
ingredient of long memory in relative prices of submitted 
orders. We stress that the observation of weak memory ef- 
fect in the simulated recurrence intervals calls for further 
improvements of the modified Mike- Farmer model |52j . 
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